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Abstract 

Halogen bonding in ligand-protein complexes is currently widely exploited, e.g. in drug design or supramolecular chemistry. 
But little attention has been directed to other effects that may result from replacement of a hydrogen by a strongly 
electronegative halogen. Analysis of almost 30000 hydrogen bonds between protein and ligand demonstrates that the 
length of a hydrogen bond depends on the type of donor-acceptor pair. Interestingly, lengths of hydrogen bonds between 
a protein and a halogenated ligand are visibly shorter than those estimated for the same family of proteins in complexes 
with non-halogenated ligands. Taking into account the effect of halogenation on hydrogen bonding is thus important when 
evaluating structural and/or energetic parameters of ligand-protein complexes. All these observations are consistent with 
the concept that halogenation increases the acidity of the proximal amino/imino/hydroxyl groups and thus makes them 
better, i.e. stronger, H-bond donors. 
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Introduction 

Specific non-covalent interactions of low-mass ligands with 
proteins drive properties of the enzymatic machinery in a living 
cell. According to (induced) key-lock theory (see [1] for review) a 
low-mass ligand should fit to a dedicated binding site that is 
accessible on the protein surface. This steric compatibility, known 
as van der Waals interactions, dominates ligand-protein selectivity, 
simply excluding the majority of putative ligands and favoring 
these that fit to the protein binding site. Other types of interactions 
modulate the stability of ligand-protein complexes. The strongest 
ones are electrostatic interactions between charged groups (known 
as salt-bridges, formally zero momentum in multipole expansion of 
electrostatic interactions), which energy frequently exceeds 
10 kcal/mol. 

Hydrogen bonding is the next type of interactions proven to 
contribute significantly to stabilization of protein structure and to 
the organization of intermolecular complexes (ca. 3 to 5 kcal/mol). 
The energy of a single hydrogen bond (H-bond) in ligand-protein 
complexes depends both on the type of hydrogen bond donor (D) 
and acceptor (A) and on the overall geometry of the D-H"»A 
system. The shortest H-bonds are observed for oxygen acting as a 
donor, 0H.~0 (2.70A) and OH."N (2.88A), respectively. When 
nitrogen is an H-bond donor, its distance to an acceptor is longer: 
NH".0 (3.04A) and NH".N (3.10A), respectively [2]. Subse- 
quently, numerous non-canonical weak H-bonds have been 



identified by statistical analyses of protein structures, and 
previously identified in crystals of low-mass compounds. This 
includes, amongst others, a n electron system acting as an H-bond 
acceptor [3-5], and an aliphatic carbon acting as an H-bond 
donor [4,6,7]. 

During the last decade, halogen bonding (X-bond, see [8] for 
review) has been recognized to play a similar role as H-bonding in 
protein-ligand complexes. Halogen bonds have been identified in 
many crystal structures of low-mass compounds and their 
supramolecular ensembles [8-17], as well as in complexes of 
biomolecules with halogenated ligands [18-20]. Bearing in mind 
that numerous natural drugs, and an increasing number of 
synthetic drug candidates, are halogenated [21-23]; understand- 
ing the nature and thermodynamics of halogen bonding should 
contribute to rational drug design. Currently, halogenated 
compounds are widely used in screening libraries, and comprise 
almost 20% of low-mass protein ligands listed in the Protein Data 
Bank (PDB). The role of halogenated ligands in biological systems 
has been widely reviewed, amongst others, by AufTmger et al. [18], 
Parisini et al. [24], Rendine et al, Voth & Ho [25], Voth et al. [26], 
Wilcken et al. [27] and Poznanski & Shugar [28]. 

However, there is some controversy about the energy of a 
halogen bond. In aqueous medium estimates of intra- or 
intermolecular halogen bonds vary from 0.2 [29] up to 5— 
8 kcal/mol [25], suggesting that, in biological systems, halogen- 
and hydrogen bonds may be of similar strength. However, the 
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apparently largest values for an X-bond were obtained ab initio for 
CF3-X".NH3 systems: 2.3, 4.7 and 6.4 kcal/mol for X = C1, Br 
and I, respectively [30]. These values agree with energies 
estimated by IR spectroscopy, for CF3-X"«N(CH3)3 in liquid 
noble gases, which are the best models for a non-polar solvent that 
does not interfere with solute-solute interactions: 2.1, 4.4 and 
6.8 kcal/mol for X-bonds involving CI, Br and I, respectively [31- 
33]. These halogen bonds can compete with hydrogen bonding, as 
well documented for numerous low-mass complexes i?i silica 
[34,35], in solution [36,37], and in the solid state [17,38,39]. 

Due to this revived interest in halogen bonding, the observed 
effect of a halogen atom on structural stability [25,40], or ligand 
binding [41,42], has been attributed to a direct effect of halogen 
bonding only. However, the strong electronegative and hydro- 
phobic character of halogen atoms may also contribute to intra- 
and intermolecular interactions. For example we have recendy 
shown that inhibitory activities (IC-jo) against protein kinase CK2a 
observed for a series of benzotriazoles brominated on the benzene 
ring can be explained by a balance of hydrophobic and 
electrostatic interactions [43]. 

Halogenation modulates electron density on proximal donors 
and acceptors of hydrogen bonds [44], as well as changes in 
protonation equilibria of proximal dissociable groups [45,46]. 
Well-known examples include the decrease in pK^ of fluorinated 
alcohols [47]: ethanol vs. 2,2',2"-trifluoroethanol (pK^ decrease by 
3.45) and phenol vs. pentafluorophenol (pKj^ decrease by 4.4). 

Likewise, halogenation of uracil was shown to reduce the 
hydrogen-bond-accepting, and to increase the hydrogen-bond- 
donating, capabilities of halogenated DNA bases [48-50] . Other 
illustrative examples of the direct effect of a halogen atom on 
strengthening of proximal hydrogen bonds are brominated natural 
[51,52] and synthetic [53-55] DNA, which were found to be 
much more stable than the corresponding non-brominated 
analogues. 

A further example of the foregoing is the report of Xu et al. [42] 
on a series of closely related halogenated inhibitors of phospho- 
diesterase 5 (PDE5). There are five PDB structures of PDE5 with 
bound inhibitors that differ only by substitution of a hydrogen 
atom by F, CI, Br or I, respectively (see PDB entries 3TSE, 3SHY, 
3SHZ, 3SIE, 3TSF). Location of these closely related ligands in 
the binding pocket was judged to be stabilized, besides two 
hydrogen bonds and numerous vdW interactions, by intermolec- 
ular interaction between the halogen atom pC) and the hydroxyl 
oxygen of Tyr612. However, there are also two intermolecular 
hydrogen bonds between the side-chain of Gln817, and ligands 
Od and Ns, respectively, both proximal to the halogen atom (3 
chemical bonds distance). Changes in the lengths of these, upon 
variation of the halogen substituent, reflects eventual strengthening 
of these H-bonds, not taken into account by the authors [42]. 

To our knowledge, no high-throughput analyses addressing the 
effect of a halogen atom on proximal hydrogen bond(s) have yet 
been reported for ligand-protein systems [13,19,24,25,56-61]. We 
herein analyze the effect of the halogen atom of a halogenated 
hgand on the lengths of hydrogen bonds (both proximal and 
distal), identified in two families of proteins: protein kinases (EC 
2.7) and acyltransferases (EC 2.3). 

Results and Discussion 

1 PDB Screening 

To avoid the eventual effect of protein-specific ligand binding 
modes, two protein families were analyzed. Protein kinases (EC 
2.7) and acyltransferases (EC 2.3) are the proteins for which the 
largest number of structures with halogenated ligands was 



identified in the PDB. All complexes of ligands with proteins of 
these two families were analyzed. A total of 3852 PDB entries was 
found, 3187 with non-halogenated ligands, LH, 505 with 
fluorinated ones, LF, and 408 containing halogenated (not 
fluorinated) ligands, LX, contributing together to 1228 records 
of acyltransferases and 2624 records of prot(dn kinases. After 
exclusion of protein sulfur as either hydrogen bond acceptor or 
donor, a total number of 24470 hydrogen bonds was identified, 
1930 with fluorinated, 1390 with halogenated ligands, and 21 150 
with non-halogenated ligands, respectively. In addition, 41 
intermolecular H-boiids to prott'in sulfur (Met or Cys) were 
excluded from further analyses (see Table 1 for the short statistics). 

2 Distribution of Hydrogen Bond Lengths as a Function 
of H-bond Topology 

Hydrog(;ii bonds were grouped according to eight possible 
topologies of hydrogen bond donor-acceptor pairs, i.e. N- 
Hug~-Op„t (NH.~0), Nug~.H-Op,o, (N~-HO), 0-Hug~.Op,o, 
(OH...O), Oug...H-Op„. (O...HO), N-H,-..Np„„ (NH-N), 
N,ig...H-Np,„, (N."HN), 0-H„g".Np„„ (OH-N), and 0,-~H- 
Nprot (0»"HN). The distributions obtained for non-halogenated 
ligands are presented in Figure 1. For ligands acting as H-bond 
donors, two left shifted distributions identify the topologies of the 
shortest H-bonds types (i.e. OH«~N and 0H~»0), medians of 
which are the lowest (Figure lA). The intermediate distribution 
(NH«"0) is also characterized by a higher median. The last one 
(NH~«N) is much more shifted to the right, and its median is the 
highest. The order of distributions strictly correlates with the 
average strength of H-bonds: the shorter is donor-to-acceptor 
distance, the stronger is the H-bond. This agrees with a common 
order of an average enthalpy of formation of various types of 
hydrogen bonds in biomolecules: OH~»N, 6.9 kcal/mol; 0H~'0, 
5.0 kcal/mol; NH— N, 3.1 kcal/mol, and NH~0, 1.9 kcal/mol, 
respectively [62]. 

Inspection of cumulative distributions for non-halogenated 
hgands acting as H-bond acceptors (see Figure IB) clearly 
demonstrated that hydrogen bonds involving two oxygen atoms 
are statistically the shortest, as evidenced by the left-shift of the 
cumulative distribution function towards shorter distances (and 
also smaller medians). H-bonds between two nitrogen atoms are 
the longest, and the two remaining types of hydrogen bonds of 
mixed topology, N~»HO and 0'~HN are intermediate in their 
lengths. 

Formal statistical analysis clearly demonstrated that most 
topologies of an H-bond differ significantly according to the H- 
bond donor-to-acceptor distance distribution. Interestingly, these 
differences are observed even between pairs in which the proton is 
swapped between the ligand and the protein. Hence, the 
distribution of 0H'~0 (OHiig»~Oprnt) differs from that of an 
□•••HO (0,-.^HOp„„), p<10"l Similarly OH^^^N differs from 
O^^^HN (p<-10"'") and NH^^^O differs from N^^-HO (p = 0.05). 
This significant asymmetry may be a consequence of overrepre- 
sentation of several types of H-bond donors and acceptors in 
proteins. Namely, oxygen acceptors are dominated by backbone 
carbonyls, oxygen donors are Ser, Thr and Tyr hydroxyl groups, 
nitrogen donors are mainly backbone amides, and nitrogen donors 
are solely the imidazole of His, which are rare in proteins. In 
consecjuence, the protein nitrogen H-bond acceptors are strongly 
underrepresented (see column 'n' in Table 2). 

The foregoing is vaKd for all types of ligands acting as either 
acceptor or donor of an H-bond (see Figure SI and Table SI). The 
statistical significance of the obs(;r\(-d differences in donor- 
acceptor distance distributions was evaluated, separately for the 
three types of ligands, with the aid of the non-parametric Kruskal- 
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Figure 1 . Cumulative distributions of donor-acceptor distances 
determined for various types of intermolecular hydrogen bond 
donor-acceptor pairs identified in complexes of proteins with 
non-halogenated ligands, in which the ligand is either a 
hydrogen bond donor (A) or acceptor (B). 
doi:10.1371/joumal.pone.0099984.g001 



Wallis test (p<10~^). Since these differences were found globally 
significant, the post-hoc approach was used to identify those pairs 
that significantly differ. Estimated p-values, together with the 
number of identified H-bonds, and mean rank of donor-acceptor 
distances, are presented in Table 2 and Table SI. 

The majority of the analyzed pairs of distributions for non- 
halogenated ligands (LH) differ significantly (23 out of 28, 
assuming a significance level of 0.05). In the case of fluorinated 
(LF) and other halogenated ligands (LX), the small number of 
identified hydrogen bonds of the type N«"HO (Niig'"H-Op„,t with 
n = 4 or 2 H-bonds found for LH and LX ligands, respectively) 



and NH—N (NHiig—Np„,t with n = 1 and 0, respectively), 
precluded analysis of these two types of hydrogen bonds. For the 
remaining groups, distributions for 11 out of 14 possible pairs 
differ significantly both for fluorinated (LF) and otherwise 
halogenated (LX) ligands (Table SI). Li this context, the hydrogen 
bond lengths to halogenated or non-halogenated ligands must be 
compared separately for eight groups representing all possible 
topologies of hydrogen bonding in hgand-protein complexes. 
Otherwise, the differences in representation of various types of 
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hydrogen bonds would contribute in an uncontrolled manner to 
the observed distance distributions. 

The most common types of ligand-protein intermolecular 
hydrogen bonds in the PDB, NH~«0 and 0"»HN, display almost 
identical distributions for non-halogenated hgands (LH, see 
Figure 2A), but become visibly different for fluorinated (LF) or 
otherwise halogenated (LX) ligands (see Figure 2B, C). In both 
latter cases the distributions of the NH»"0 hydrogen bond lengtiis 
are shifted left relative to those of the 0»~HN. However, for 
fluorinated ligands the medians are, by chance, almost equal. The 
observed differences are statistically significant only for haloge- 
nated ligands (p = 0.03), but they are also probable for fluorinated 
ligands (p = 0.09) (see Table 2). It should be stressed that the 
observed differences in medians for fluorinated (LF) and haloge- 
nated (LX) ligands (0.01 and 0.03 A respectively, see Table 3) 
exceed the precision of PDB records. Overall, this clearly shows 
that ligand substitution with electronegative atoms (F, CI, Br, I) 
results in variation of the lengths of intermolecular hydrogen 
bonds. Moreover, this effect strongly depends on the type of 
hydrogen bond (Table SI). 

3 Hydrogen Bonding to Halogenated or Fluorinated vs. 
Non-halogenated Ligands 

The effect of a halogen atom on the distribution of hydrogen 
bond lengths was analyzed separately for the four most abundant 
types of hydrogen bonds: OH—O, NH—O, N—HN and O—HN 
(i.e. a protein oxygen being a hydrogen bond acceptor and a 
protein nitrogen being a hydrogen bond donor, see Table 3 for 
numbers). Cumulative distributions of hydrogen bond lengths 
estimated for fluorinated (LF) and otherwise halogenated (LX) 
ligands are, for some of the H-bond topologies, shifted towards 
shorter distances in comparison to non-halogenated ligands (LH). 
It is shown in Figure 3, and also confirmed by lower mean ranks 
collected in Table 3. Substitution with halogen atom mosdy affects 
the lengths of 0H"«0 hydrogen bonds (Figure 3C). Smaller, but 
still visible, changes are observed for NH»"0 (Figure 3A) and 
N"»HN (Figure 3B), while almost no variations are observed for 
0»"HN hydrogen bonds (Figure 3D). This is fully confirmed by 
the Mann-Whitney U test (see Table 3, and Table S2 for all H- 
bond topologies). Amongst them, hydrogen bonds to fluorinated 
ligands (LF) are significantly shorter for five out of seven tested 
pairs of distributions, while halogenated ligands (LX) differ 
significandy from non-halogenated ones (LH) only for the NH~«0 
type. It should be stressed that the medians for H-bond length with 
halogenated ligands (either LF or LX) are generally lower than 
those for non-halogenated ones (LH). This can also be easily 
checked via mean ranks (LX<LH and LF<LH). 

In general the effect of fluorine vs. other halogens atoms follows 
the electronegativity scale. Fluorine changes properties of nitrogen 
both as acceptor and donor of hydrogen, and oxygen as donor of a 
hydrogen bond, whereas chlorine, bromine and iodine affect only 
hydrogen bond donors (both oxygen and nitrogen). The latter 
effect is clearly detectable for medians (decrease by 0.06 and 0.03 
A for 0H»"0 and NH»"0, respectively) and is statistically 
significant (p = 0.02 and p<10 see Table 3 for details). No 
variations are observed for halogenated ligands (LX) acting as H- 
bond acceptor. It is worth noting that the observed differences 
agree with ah initio simulations of base pairing of halogenated 
uracil with adenine [48] . 
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Don***Acc distance [A] 

Figure 2. Cumulative distributions of NH">0 (blue) and 0-~l-IN 
(red) intermolecular hydrogen bonds identified in protein 
complexes with non-halogenated (A, LH), fluorinated (B, LF) 
and otherwise halogenated ligands (C, LX). See Table 2 for details. 
doi:1 0.1 371 /journal.pone.0099984.g002 



4 Validation of the Differences Observed for Distributions 
of H-bond Donor-acceptor Distance 

Proteins form H-bonds via various types of donors and 
acceptors. In view of the adopted approach, all types of hydrogen 
bonds should be analyzed separately, but the number of actually 
identified interactions with halogenated ligands makes results of 
such detailed analysis statistically insignificant. However, the large 
subset of H-bonds between ligands and protein backbone (i.e. 
carbonyl oxygen and amide nitrogen) enables analysis of much 
more homogenous subsets of protein-ligand interactions. The 
results generally agree with those obtained for all protein H-bond 
acceptors and donors (see Figure S2), confirming again the 
statistical significance of the effect of a halogen atom on lengths of 
intermolecular hydrogen bonds between a halogenated ligand and 
a protein. 

A key point of the presented analysis is the significance of the 
results presented in the context of the quality of PDB structures. In 
fact, there are only a very limited number of X-ray structures of 
closely related halogenated ligands bound to the same protein that 
can be compared directly (e.g. PDE5 mentioned in the Introduc- 
tion). Moreover, the resolution of X-ray structures precludes any 
direct interpretation of distances that differ by an order of 0.01 A. 
AH donor-acceptor distances must be regarded biased, but 
differences between observed distributions, as presented in 
Figures 1-3, may be considered as significant, since there is no 
factor explaining any systematic differences in biases for haloge- 
nated and non-halogenated ligands. However, to assess an 
eventual effect of quality of structures on the significance of 
observed differences in distance distributions, the analyses were 
repeated for two subsets of high-resolution X-ray structures, 
resolutions of which were better than 2.0 and 1.5 A, respectively, 
and the general tendency to strengthening of H-bonds between 
protein and halogenated ligands (both LF and LX) acting as 
hydrogen bond donor was preserved (see Table S3). 

Materials and Methods 

Structural Data 

The Protein Data Bank (PDB, [63]) was searched to identify all 
entries of protein kinases (EC 2.7) and acyltransferases (EC 2.3). 
Those containing ligands with at least one oxygen/nitrogen bound 
to a carbon atom were subjected to further analysis. 

Structural Analysis 

AU analyses were performed with the aid of the Yasara Model 
package [64] . For each class of protein, all intermolecular ligand- 
protein hydrogen bonds were identified, using 3.5 A as a threshold 
for the distance between putative hydrogen bond donor and 
acceptor. The distributions of donor-acceptor distances were 
determined separately for three classes of ligands: non-halogenated 
(LH), fluorinated (LF) and others that are halogenated, but not 
fluorinated (LX). These data were then assigned to one of eight 
groups, according to the topology of the hydrogen bond. The 
latter was defined according to the ligand atom (oxygen or 
nitrogen) being either donor or acceptor of a hydrogen bond with 
protein nitrogen or oxygen. Since fluorinated ligands (LF) are the 
internal reference for the effect of other halogen atoms that may 
contribute in halogen bonding (LX), all heterogenic ligands, which 
were simultaneously fluorinated and modified with chlorine/ 
bromine/iodine, were excluded from the analysis. 

The most abundant types of hydrogen bonds (i.e. NHiig'^Opj,,,, 
OHi;g«"Op„rt, Oiig"»HNp„,t, and Ni;g"'HNp„,t) were additionally 
analyzed according to homogenous substitutions with only 
Fluorine; Chlorine, Bromine or Iodine. All heterogeneously 



PLOS ONE I www.plosone.org 



6 



June 2014 | Volume 9 | Issue 6 | e99984 



Halogen Atom Affects Hydrogen Bonding 



CDF 



0.8 



0.6 



0.4 



0.2 



0.0 




CDF 



2.0 



2.5 3.0 
Don"»Acc distance [A] 



3.5 




2.5 3.0 
Don"»Acc distance [A] 



3.5 



CDF 



0.8 



0.6 



0.4 



0.2 



0.0 




CDF 



0.8 



0.6 



0.4 



0.2 



0.0 




2.0 2.5 3.0 

Don"»Acc distance [A] 



3.5 



2.0 2.5 3.0 

Don»"Acc distance [A] 



3.5 



Figure 3. Effect of a halogen atom on cumulative distributions determined for tiie four most abundant types of hydrogen bond 
donor-acceptor pairs: NH»"0 (A), OH—O (B), N—HN (C), and O—HN (D), respectively. The distributions estimated for non-halogenated (LH), 
fluorinated (LF) and other halogenated ligands (LX) are presented in blacl<, blue and green, respectively. See Table 3 for details. 
doi:1 0.1 371 /journal.pone.0099984.g003 



substituted ligands (e.g. bromo-fluoro or chloro-iodo) were 
excluded from this analysis. 

Multiple protein molecules in the crystal cell, as well as objects 
displaying partially occupied forms (i.e. side-chain rotamers or 
ligand locations) were analyzed separately. Hydrogen bonds with 
water molecules were not analyzed. 

Statistical Analysis 

To circumvent the eventual requirement of categorization, all 
distributions are presented in a cumulative manner as a CDF 
(cumulative distribution function), which is the integral of a 
distribution function. This form of presentation helps in visual 
comparison of various distributions, overcoming the problem of 
balancing the resolution (i.e. the number of beans in a histogram) 
and statistical noise (i.e. numbers of counts in beans). In all figures 
the curve most shifted to the left identifies the dataset character- 
ized by the shortest donor-acceptor distances. 

Since, according to the Anderson-Darling test [65], most 
distributions of hydrogen bond donor-acceptor distances were 



found not Gaussian (data not shown), the statistical significance of 
observed differences was estimated according to nonparametric 
tests. For comparison of two distributions the Mann- Whitney U 
test [66] was used. The Kruskal-WaUis test [67], which is a 
generalization of the U-test, was applied for 3 or more groups. 

The Mann-Whitney U-test is a first choice alternative to 
Student's t-test, when applied to two data-sets that are not 
necessarily normally distributed. Formally, it detects differences in 
shape of tested distributions: each group is characterized by its 
mean rank, i.e. the average position of its components in the list 
created by sorting both datasets. For each pair of distributions, the 
smaller value of the mean rank (Rj) identifies the group that is 
characterized by a shorter distance (see Tables 2, 3). The value of 
U; is the corresponding test statistics, (U; = n;*[R;— (n;+l)/2]; u; is 
the size of dataset ij, and Zu is the associated value of the standard 
Gaussian distribution. Positive Zu value (equivalently higher mean 
rank for LH) indicates that distances for halogenated ligands are 
shorter, and the corresponding p-value estimates the statistical 
significance of observed differences. The medians were also 
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compared for selected pairs of distributions according to the 
appropriate median test [68]. 

All analyses were performed using the Statistica 10 [69]. Null 
hypotheses that given distributions do not dilfer one from the other 
were tested at a significance level, a = 0.05, and those with p- 
values below 0.05 were rejected, and distributions regarded as 
different. 

Conclusions 

Hydrogen bond length distributions in protein-ligand com- 
plexes are significandy different for non-halogenated ligands 
(LH) compared to halogenated ones (LF, LX). The H-bond 
donor-acceptor distances are significantly shorter for a haloge- 
nated ligand acting as a hydrogen bond donor (at significance 
level 0.05). However H-bond lengths seem irrelevant for 
halogenations, when the ligand oxygen is a hydrogen bond 
acceptor. All these observations are consistent with the idea that 
halogenation increases the acidity of proximal amino/imino/ 
hydroxyl groups and thus makes them better, i.e. stronger, H- 
bond donors. 

Supporting information 

Figure SI Cumulative distributions of donor-acceptor distanc- 
es determined for various types of intermolecular hydrogen 
bond donor-acceptor pairs identified In complexes of proteins 
with non-halogenated ligands, in which the ligand is either a 
hydrogen bond donor (A, C, E) or acceptor (B, D, F); 
determined for non-halogenated, LH, (A, B), fluorinated, LF, 
(C, D), and otherwise halogenated (i.e. not fluorinated), LX, 
ligands (E, F). 
(TIF) 

Figure S2 Effect of a halogen atom on cumulative distributions 
determined for the donor-acceptor pairs determined for hydrogen 

bonds between ligand and protein backbone (carbonyl oxygen: 
A, C or amide nitrogen" B, D). The distributions estimated for 
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